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INTRODUCTION: Before the 1800s, dogs were 
probably primarily selected for functional roles 
such as hunting, guarding, and herding. Modern 
dog breeds are a recent invention defined by con- 
formation to a physical ideal and purity of lineage. 
Breeds are commonly ascribed temperaments 
and behavioral proclivities based on the purported 
function of the ancestral source population. By 
extension, the breed ancestry of individual 
dogs is assumed to be predictive of tempera- 
ment and behavior. Through our community 
science project Darwin’s Ark (darwinsark.org), 


A Biddability 


we enrolled a diverse cohort of pet dogs to 
explore how genetics shapes complex behavioral 
traits in this exceptional natural model. 


RATIONALE: Dogs are a natural system for investi- 
gating the genetics of complex traits. Millions of 
pet dogs live in human homes, sharing our envi- 
ronment, and receive sophisticated medical care. 
Behavioral disorders are treated with human 
psychiatric drugs, achieving similar response 
rates, and genetic studies suggest shared etiology 
with some human psychiatric conditions. 


How readily dog responds to human direction, especially in the context of training 
eight questions; defined by factor analysis of complete surveys for 10,253 dogs 


Heritability 
(1910 sequenced dogs) 
30.5 + 8.5% (+SE) 


B Top 10 most 
differentiated breeds: 


Independent 


Basset hound 


Biddable [aust Independent 


Cc 
<B> ee. 


Alaskan malamute 
Shiba inu 
Miniature pinscher 
@ Great pyrenees 
@) Siberian husky 

@) Beagle D 
@) West highland white terrier 
© Dachshund 

O Yorkshire terrier 


Biddable 


Belgian malinois 
Vizsla 
Border collie 
Australian cattle dog 
German shepherd dog 
Australian shepherd 
Golden retriever 
Bernese mountain dog 
Catahoula leopard dog 
Labrador retriever 


Sprocket 


Permutation z-score 


@e- 00860 


-4-20 2 4 p<0.05 


Lily 


Mixed-breed dogs with 
border collie ancestry (N = 638) 


— Border collies 


SS ’ 
Caboose 


TWooie 


0 
Factor score 


Effect of breed on behavior. (A) Biddability is among eight behavioral factors defined from surveys. 

SE, standard error. (B) Dogs in some breeds tend to score unusually high or low for this factor compared 
with dogs overall. (© and D) Border collies score lower on average for biddability (vertical line at median) 
but vary widely (C), including genetically confirmed border collies (D). (E) In mixed-breed dogs, border collie 
ancestry has a small genetic effect on biddability. [Photo credits: K. Wirka (Sprocket); M. Wisniewski 
(Caboose); B. Hoadley (Molly); M. Logsdon (Hunter); A. Macias (Lily); S. Staples (TWooie)] 
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We developed Darwin’s Ark as an open data 
resource for collecting owner-reported pheno- 
types and genetic data and invited any dog 
owner to enroll their dog. We paired this with 
low-pass sequencing to capture nearly all com- 
mon variation in this outbred population. Our 
inclusive approach achieved the large samples 
needed to investigate complex traits. 


RESULTS: We surveyed owners of 18,385 dogs 
(49% purebred) and sequenced the DNA of 
2155 dogs. Most behavioral traits are heritable 
[heritability (h7) > 25%], but behavior only subtly 
differentiates breeds. Breed offers little predic- 
tive value for individuals, explaining just 9% 
of variation in behavior. For more heritable, 
more breed-differentiated traits, like biddabil- 
ity (responsiveness to direction and commands), 
knowing breed ancestry can make behavioral 
predictions somewhat more accurate (see the fig- 
ure). For less heritable, less breed-differentiated 
traits, like agonistic threshold (how easily a dog 
is provoked by frightening or uncomfortable 
stimuli), breed is almost uninformative. 

We used dogs of mixed breed ancestry to test 
the genetic effect of breed ancestry on behavior 
and compared that to survey responses from 
purebred dog owners. For some traits, like bid- 
dability and border collie ancestry, we confirm a 
genetic effect of breed that aligns with survey 
responses. For others, like human sociability 
and Labrador retriever ancestry, we found no 
significant effect. 

Through genome-wide association, we found 
11 regions that are significantly associated with 
behavior, including howling frequency and hu- 
man sociability, and 136 suggestive regions. 
Regions associated with aesthetic traits are 
unusually differentiated in breeds, consistent 
with a history of selection, but those associated 
with behavior are not. 


CONCLUSION: In our ancestrally diverse cohort, 
we show that behavioral characteristics ascribed 
to modern breeds are polygenic, environmentally 
influenced, and found, at varying prevalence, in 
all breeds. We propose that behaviors perceived 
as characteristic of modern breeds derive from 
thousands of years of polygenic adaptation that 
predates breed formation, with modern breeds 
distinguished primarily by aesthetic traits. By 
embracing the full diversity of dogs—including 
purebred dogs, mixed-breed dogs, purpose-bred 
working dogs, and village dogs—we can fully 
realize dogs’ long-recognized potential as a 
natural model for genetic discovery. 
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Behavioral genetics in dogs has focused on modern breeds, which are isolated subgroups with distinctive 
physical and, purportedly, behavioral characteristics. We interrogated breed stereotypes by surveying 
owners of 18,385 purebred and mixed-breed dogs and genotyping 2155 dogs. Most behavioral traits 
are heritable [heritability (h2) > 25%], and admixture patterns in mixed-breed dogs reveal breed 
propensities. Breed explains just 9% of behavioral variation in individuals. Genome-wide association 
analyses identify 11 loci that are significantly associated with behavior, and characteristic breed 
behaviors exhibit genetic complexity. Behavioral loci are not unusually differentiated in breeds, but breed 
propensities align, albeit weakly, with ancestral function. We propose that behaviors perceived as 
characteristic of modern breeds derive from thousands of years of polygenic adaptation that predates 
breed formation, with modern breeds distinguished primarily by aesthetic traits. 


odern dog breeds are less than 160 years 

old (~50 to 80 generations), a blink in 

evolutionary history compared with the 

origin of dogs more than 10,000 years 

ago (J, 2) (Fig. 1A). Prehistoric wolves 
likely adapted to use human refuse through 
changes in morphology, behavior, metabolism, 
and reproduction (3-7). Early humans may have 
given favored dogs increased access to limited 
resources, but there is little evidence of humans 
intentionally breeding dogs until 2000 years ago 
(8, 9). By contrast, the modern dog breed, em- 
phasizing conformation to a physical ideal and 
purity of lineage, is a Victorian invention (0). 
Before the 1800s, dogs were primarily selected 
for functional roles such as hunting, guarding, 
and herding (11)—heritable behaviors derived 
from the wolf predatory sequence (12). Modern 
breeds retain these component motor patterns, 
but their contexts, sequences, and thresholds 
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vary (12, 13). The extent to which ancient be- 
havioral propensities persist in modern breeds, 
defined primarily by aesthetics and often dis- 
connected from functional behavioral selec- 
tion, is unclear. 

Dogs with ancestry from a single modern 
breed (purebred dogs) predominate in genetic 
studies, which capitalize on their unusual pop- 
ulation history and limited genetic diversity 
(14-16), but are a minority of all dogs (3, 17). 
More than 80% of the nearly 1 billion dogs on 
Earth are free-living, free-breeding, and not 
under human control (e.g., village dogs) (3). 
Even in countries with large purebred pop- 
ulations, dogs with ancestry from more than 
one breed are common [~50% in the United 
States (78)]. Herein, we use the word “mutt” 
to describe a dog with ancestry from more 
than one breed, and potentially from non- 
breed populations. 

Modern breeds are commonly ascribed char- 
acteristic temperaments (e.g., bold, affectionate, 
or trainable), and behavioral proclivities on the 
basis of their purported ancestral function (e.g., 
herding or hunting) (19, 20). By extension, the 
breed ancestry of an individual dog is assumed 
to be predictive of temperament and behavior 
(21), with dog DNA tests marketed as tools for 
learning about a dog’s personality and training 
needs (22). Studies, however, found that within- 
breed behavioral variation approaches levels 
similar to the variation between breeds (23, 24), 
suggesting that such predictions are error 
prone even in purebred dogs. 

Behavioral traits in dogs are also a poten- 
tially powerful natural model for human neu- 
ropsychiatric disease. Pet dogs are regularly 
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treated with human psychiatric drugs, includ- 
ing selective serotonin reuptake inhibitors, 
and have similar response rates, and genetic 
studies suggest shared etiology (25-29). Dog 
behavioral traits are polygenic, driven by many 
small effect loci and the environment (30, 31). 
Given this genetic complexity, the success of 
survey-based phenotypes for mapping com- 
plex human diseases (32, 33), and the avail- 
ability of well-validated dog-owner surveys 
(34-37), a large-scale, survey-based study de- 
sign is ideal for investigating the genetics of 
canine behavior. 

Through our community science project 
Darwin’s Ark (darwinsark.org), we enrolled a 
diverse cohort of pet dogs to explore the com- 
plicated, and sometimes unexpectedly weak, 
relationship between breed and behavior. We 
show that using ancestrally diverse dog cohorts 
enables more powerful studies of behavioral 
genetics in this notable natural model. 


Results 
Survey data 


We developed Darwin’s Ark as an open data 
resource for collecting owner-reported pheno- 
types and genetic data in dogs. Dog owners were 
asked to complete 12 short surveys (117 ques- 
tions) on behavioral and physical traits (Fig. 1, 
B and C; figs. S1, S2A, and S3; and table 
$1). Darwin’s Ark surveys, each with no more 
than 10 questions, are designed to be easy to 
complete. Owner survey responses are suscep- 
tible to rater bias, including the influence of 
breed stereotypes. 

To ascertain size, we asked whether the 
dog is ankle-high, calf-high, knee-high, thigh- 
high, or hip-high rather than requiring own- 
ers to measure their dog (fig. S2A). This simple 
question proved effective when validated in 
three ways: (i) We measured dogs [N = 38 dogs; 
Pearson correlation coefficient (Rpearson) = 0.86; 
p =3 x10); (ii) owners measured their dogs 
(N = 337 dogs; Rpearson = 0.845 p = 6 x 1079); 
and (iii) we tested correlation with breed aver- 
age height (38) in the subset of dogs that were 
purebred (N = 2025; Ryearson = 0.85; p < 2.2 x 
107'°) (Fig. 1D, fig. $2, and data $1). 

Owners answered, on average, 100 + 34 (+SD) 
questions per dog; 70% of dogs have answers 
for more than 95% of questions (22) (Fig. 1, E 
and F). For the 48 questions drawn from the 
Dog Personality Questionnaire, between- 
question correlations matched published re- 
sults (22, 37) [Mantel’s correlation coefficient 
(r) = 0.95, p = 1 x 10”; fig. SI]. 

Overall, the Darwin’s Ark cohort (NV = 18,385; 
85% from the United States) is broadly similar 
to the US dog population. Half (49.2%) are re- 
ported as purebred (/8), and breed frequencies 
(Fig. 1G) correlate with US breed popularity 
(39) (Rpearson = 0.88; p = 1.48 x 10~*”) (table S2). 

To reduce the dimensionality of the sur- 
vey data, we performed exploratory factor 
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Fig. 1. The ancestry-inclusive Darwin’s Ark project collected surveys and genetic 
data from a diverse cohort of dogs. (A) Selection on behavior in dogs predates 
modern breeds, which have existed for ~1% of dog history (10). (B and C) Surveys 
include (B) 79 published behavior questions (34-37) and 39 new questions about 
heritable, easy-to-identify traits that fall into (C) four categories that potentially vary in 
heritability, including physical traits (fig. S3). (D) Owner responses to the size question 
(fig. S2) were highly correlated with measured size in 375 dogs (Pearson correlation). 
Boxes and whiskers represent 25% quartile, 75% quartile, minimum, and maximum, 
with horizontal line at median. (E) Owners of most dogs answered >95% of questions. 


(F) Upset plot visualizing the dataset. Six horizontal bars show the number of dogs 
subsetted by information type or breed category (“subsets”). Each column is a possible 
intersection of subsets, with black circles indicating the inclusion of a subset and 
vertical bars showing the number of dogs intersecting. Four rows (‘‘pools”) represent 
the dogs used in four major analyses. Plus signs denote the inclusion of an intersection, 
with total number of dogs (N) on the right. (G) The frequency of breeds among 
purebred dogs in Darwin's Ark (left), compared with the 14 most common breed 
ancestries we detected in all dogs through global ancestry inference (middle). More 
popular breeds tended to be guessed more frequently by MuttMix participants (right). 


analysis (22), defined eight factors that ex- 
plain a cumulative 24.3% of variance in be- 
havior (fig. S4 and tables S3 and S4), and 
scored 16,522 dogs. We named each factor 
for the facet of behavior captured and es- 
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tablished standard terms for describing low- 
and high-scoring dogs (Fig. 2A). In subse- 
quent analyses, we examined the influence 
of breed and genetics on each behavioral fac- 
tor (Fig. 2B). 
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Genetic data 

Nearly half of the Darwin’s Ark cohort are 
mutts, an understudied population with un- 
characterized genomic diversity. We whole- 
genome sequenced (mean coverage = 46x; 
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Fig. 2. Behavioral traits 
do not define breeds 

the way aesthetic traits 
like size do. (A) Explor- 
atory factor analysis 

bins behavioral questions 
into eight inferred factors, 
which correspond to latent 
behavioral propensities 
(blue, negative score; 

ed, positive score). (B) Ina 
series of seven analyses, 
we explored how behavior 
elates to breed in the con- 
text of size, which is a 
strongly breed-differentiated 
trait. For each column, 
circle size is proportional to 
the minimum-maximum 
normalized values of (i) LD- 
corrected h’syp, (ii) effect 
size of breed in ANOVA 
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range 30x to 67x) 27 pet dogs of unknown 
breed ancestry (the “Mendel’s Mutts” cohort), 
including 26 from the United States (one orig- 
inally from Mauritius and two from St. Kitts) 
and one from Ireland (data $2). We compared 
jointly called variant records in this cohort with 
published whole genomes for 530 purebred 
dogs from 128 breeds (22, 40). 

Sequencing mutts efficiently captures com- 
mon variants in the dog population, including 
variation not detected by sequencing large 
numbers of purebred dogs. Sequencing an 
additional mutt yields nearly as many new 
variants as sequencing a purebred dog, even 
when each new purebred dog is from a dif- 
ferent breed (Fig. 3A). In the 27 mutts, we dis- 
covered 11,974,853 biallelic single-nucleotide 
polymorphisms (SNPs) total, including 375,474 
variants not found in the 128 breeds (530 dogs). 

We also confirm that genetic variation private 
to a single breed is rare [38,097 + 13,206 (+SD) 
SNPs per breed after excluding Tibetan mastiffs, 
amore distantly related lineage (47) with 651,551 
private SNPs]. Breeds are not distinguished 
by asmall number of “breed-defining” variants 
(table S5). After analyzing all 13 breeds with 
more than 10 sequenced representatives, we 
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found just 332 SNPs (298 autosomal) exclusive 
to, and fixed in, a single breed (data S3) out 
of 16,702,091 SNPs total (0.002%). Tibetan 
mastiffs account for 142 SNPs (121 autosomal), 
with just 16 + 31 (+SD) SNPs (15 + 29 auto- 
somal) in each of the nine other breeds. These 
variants are unlikely to affect phenotype. An- 
notation with SNPEff classifies 98.2% (326) as 
occurring at loci without obvious function. 
Mendel’s Mutts have shorter runs of homo- 
zygosity than purebred dogs (Fig. 3B) and 
linkage disequilibrium (LD) that decays more 
rapidly (fig. S5). Thus, genotyping arrays de- 
signed with sufficient marker density for pure- 
bred dog studies miss much of the genetic 
variation in mutts. Average squared correlation 
coefficient (7°) between SNPs drops below 0.2 at 
9.8 kb, which is slightly longer than in village dogs 
(6.2 kb) but 5- to 10-fold shorter than in breeds 
(fig. S5). Because of the short LD, the markers 
included on the Illumina Canine HD Beadchip 
(N = 171,882) and the Axiom Canine Genotyping 
Array Sets A and B (NV = 1,011,992) tag only 19 and 
53% of genomic variation in mutts, respectively, 
compared with 51 and 85% in breeds (Fig. 3C). 
We adopted a low-pass sequencing and im- 
putation approach (42-46), using a reference 
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panel of 435 deeply sequenced dogs and other 
canids (data S4). We validated by resequenc- 
ing 11 mutts with high-coverage whole-genome 
sequencing (WGS) at low coverage [1.0x + 0.6x 
(+SD)]. We imputed, on average, 32,438,672 
SNPs and 13,910,371 insertions and/or deletions 
(indels) per dog, or 19.8 + 6.9 (+SD) variants per 
kilobase (~40x denser than the Axiom array), 
which was sufficient to tag nearly all the com- 
mon variants (94% tagged by a marker within 
100 kb and 87% within 1 kb) (Fig. 3C). Con- 
cordance between low-pass and 30x sequencing 
was 98.3 + 0.7% (+SD) (N = 11 dogs; ~7.7 million 
common SNPs), which was slightly lower than 
that between the Axiom array and 30x (99.3 + 
0.1%; N = 10 dogs; 0.83 million SNPs) but 
better than that between imputed array calls 
and 30x (97.3 + 0.3%; 7.6 million SNPs), in- 
cluding higher concordance for heterozygous 
genotypes (98.9 versus 98.3%) (data S5). 

Our final genetic dataset comprises 1715 
Darwin’s Ark dogs sequenced at 0.6x + 0.3x 
(+SD) coverage, each genotyped for 32,213,747 + 
141,060 (+SD) SNPs, and 440 dogs genotyped 
on the Axiom array and imputed using the 
same pipeline for 32,006,290 + 157,307 (+SD) 
SNP genotypes, for a total of 2155 dogs (data S6). 
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calls breeds comprising 
>5% ancestry accurately 
but misses lower-frequency 
breeds and (E) can discern 
admixture occurring up 

to ~12 generations ago 
(~24 to 36 years). Error bars 
in (E) represent standard 
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eds <5% 
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generations since 
breed admixture 


Caramel: Purebred 


fraction of ancestry 


RY 
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Hubble: F1 hybrid 


matching owner-reported breed 


%€r spaniel 


Coconut: Recently admixed Clarence: Highly admixed 


# breeds detected at >5% 


deviation across 10 simulations of 100 admixed genomes. (F) For dogs categorized as confirmed purebred based on owner reports, breed calling assigns >85% ancestry (vertical 
dashed line) to the owner-reported breed in 90% of dogs. For candidate purebred dogs, 58% meet this criterion (4.4% had no detectable ancestry from the owner-reported 
breed). Just 5% of dogs categorized as mutts have >85% ancestry from their most common breed (blue dashed line). (G) Most mutts have ancestry detected (>5%) from 
more than three breeds (1205 dogs total). (H) Examples of breed calling in four dogs with different ancestry types: Caramel, a purebred dog, who has 93% of her ancestry 
assigned to her owner-reported breed; Hubble, an F; goldendoodle; Coconut, who has apparent mutt ancestry mixed with dalmatian; and Clarence, a mutt with <45% ancestry 


from any one breed (*Staffordshire bull terrier). [Photo credits: M. Movassagh (Caramel); J. Luban (Hubble); A. Pensarosa (Coconut); R. Skloot (Clarence)] 


We selected dogs for sequencing on the basis of 
the enrollment date and survey completion rate 
(7.4% of dogs had sequencing funded by the 
owner’s donation). 


Breed ancestry assignment 


In our genetic data, owner-reported breed is 
a reasonable proxy for predominant genetic 
ancestry. We developed a breed-calling pipe- 
line using the software ADMIXTURE (22, 47) 
to infer ancestry using a supervised analysis 


Morrill et al., Science 376, eabk0639 (2022) 


and a reference panel of 101 breeds (12 dogs 
per breed; 688,060 SNPs) collated from public 
and Darwin’s Ark data (Fig. 3, D and E; figs. S6 
and S7A; and table S2). Genetically inferred 
breed ancestry across dogs correlated well 
with the proportions of dogs registered to 
breeds in the American Kennel Club (93 breeds, 
Roearson = 0.745 p = 2.9 x 10°") (Fig. 1G). 

The top breed that was called matched the 
owner-reported breed in 98.7% of dogs de- 
scribed as registered purebred (N = 304) and 
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in 85.8% of all dogs for which owners report 
just a single breed (NV = 885) (table S6). Dogs 
described as registered purebreds vary some- 
what in the percent ancestry assigned to the 
owner-reported breed (potentially because the 
reference data are not representative of all 
diversity in the breed or because of shared 
ancestry between breeds). We empirically set 
the threshold for defining a dog as genetically 
“purebred” as when 85% of ancestry is inferred 
to come from a single breed because 90% of 
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dogs described as registered purebred by own- 
ers fell above this threshold (Fig. 3F). 

We designated three classifications of breed 
ancestry: (i) “confirmed purebred dogs” were 
either described as registered purebred by the 
owner or confirmed by sequencing (3637 dogs), 
(ii) “candidate purebred dogs” included all 
confirmed purebred dogs and dogs with owner- 
reported ancestry from one breed (9009 dogs), 
and (iii) “mutts” were all other dogs (9376 dogs) 
(Fig. 1F). Genetically inferred ancestry super- 
seded owner-reported breed when both were 
available, although discrepancies were rare 
(15 out of 556 candidate purebreds, 3 out of 
323 confirmed purebred) and were primarily 
nominal variations on the same breed (e.g., 
Landseer versus Newfoundland). Extrapolat- 
ing from the subset of dogs with genetic data, 
89.7% of registered purebred and 58.2% of dogs 
with owner-reported ancestry from one breed 
would, if sequenced, have >85% ancestry called 
from their owner-reported breed (Fig. 3F). Be- 
cause confirmed purebred dogs have a subs- 
tantially higher percentage of their ancestry 
assigned to their owner-reported breed, in sub- 
sequent analyses, we prioritized the confirmed 
set or, when the larger and more diverse can- 
didate set was useful, validated findings in the 
confirmed set. 

Mutts are rarely (17%) mixes of just two 
breeds. Most (66%) carry >5% ancestry from 
four or more breeds (Fig. 3G). We find that 
1071 dogs (70%) are highly admixed, carrying 
under 45% ancestry from any one breed (Fig. 
3H). The most common breed ancestry (data 
S7) is American pit bull terrier (9.9%) followed 
by Labrador retriever (6.0%), Chihuahua (5.1%), 
beagle (4.1%), and German shepherd dog (4.0%) 
(Fig. 1G and fig. S7C), varying by geographic 
region (fig. S7D). Purebred dogs had higher 
coefficients of inbreeding, as estimated from 
the proportion of the genome in runs of homo- 
zygosity [Frou = 0.06 + 0.04 (+SD); N = 633] 
than mutts (FRoy = 0.02 + 0.02; N = 1221) [for 
Student’s ¢ test p value (Dt-test) = 1.7 x 10°”? t = 
28.4, degrees of freedom (df) = 776.8] (fig. S7B). 


Heritability of surveyed traits 


Combining genetic and survey data for 1967 
dogs, we found that genetic variation explains 
more than 25% of the variation in factor scores 
for human sociability, toy-directed motor pat- 
terns, and biddability (responsiveness to com- 
mands), as well as in responses to 38 of 83 
(46%) behavioral questions and eight physical 
traits. We estimated SNP-based heritability 
(h? snp) With standard errors using restricted 
maximum likelihood (48) and LD score cor- 
rection (excluding 27 questions for which more 
than half of LD-stratified variance components 
were constrained) on genetic relationship 
matrices calculated for dogs with genetic data 
[8,518,951 autosomal SNPs with minor allele 
frequency (MAF) >2%] (data S8). Physical traits 
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are exceptionally heritable, with five out of 
eight exceeding 85% heritability. Retrieving is 
the most heritable behavioral trait [52.5 + 9.2% 
(+SE)], and human sociability is the most heri- 
table factor (factor 1, 67.3 + 13.0%). Behaviors 
related to intrinsic motor patterns and physical 
traits are more heritable than other behaviors 
(Fig. 4A). To assess whether these heritabilities 
are overestimated because of correlation be- 
tween traits and breed ancestry, we recalculated 
them by incorporating the top 10 principal com- 
ponent eigenvectors of genetic variance (49, 50). 
For the most part, we saw little change [median 
fold change of +0.02, 25% quartile = —0.049 
and 75% quartile = +0.140] between estimates 
(Rpearson = 0.97, p = 1.1 x 10-°*). Heritability 
decreases the most for biddability (factor 4, 
drops from 30.5 + 8.5% to 20.0 + 8.8%) and 
“circles before pooping” [question 64 (Q64), 
drops from 25.1 + 8.1% to 8.0 + 7.6%] (fig. S8). 


Breed explains some behavior variance 


In the owner surveys, breed explains a larger 
fraction of the variance in behavior pheno- 
types (110 questions and eight factors) than 
size, sex, or age, but the effect is relatively 
small (Fig. 4, B and C; fig. S9A; and data S9). In 
an analysis of variance (ANOVA) of confirmed 
purebred dogs representing 78 breeds, the 
breed effect, measured as generalized eta 
squared (ges) (51), averages 0.089 + 0.039 (+SD) 
(range 0.034 to 0.253), correlates with herita- 
bility (Rpearson = 0.89; p = 7.9 x 10) (fig. S9B), 
and is about fivefold higher for the physical 
traits characteristic of breeds than for behav- 
ioral traits (fig. S9C). The same analysis using 
the less stringent “candidate purebred” breed 
definition is nearly perfectly correlated with 
the confirmed purebred analysis (Rpearson = 
0.99, p = 5.2 x 10°; N = 125), with ges values 
~30% lower (mean ratio = 0.70 + 0.11) (fig. S9D). 

Age explains little of the variation [0.018 + 
0.035 (+SD)] overall, but for a subset of traits 
it exceeds 0.05, including two factors (arousal 
level and toy-directed motor patterns) and 
nine questions, which include five designed to 
assess aging-related traits (36) (Fig. 4C and fig. 
S9E). Sex has little effect (0.009 + 0.044), 
except for “lifts leg to urinate” (Q66; ges = 
0.48). Size has virtually no effect (6.6 x 10°* + 
8.6 x 10°*; range 2.5 x 107” to 0.006). 


Breed is not a reliable predictor of 
individual behavior 


For several factors, score distributions for in- 
dividual breeds differ from the distribution of 
all dogs (fig. S10), with at least a few breeds 
over- or underrepresented in the highest- 
scoring quartile (fig. S11 and data S10). These 
distributions are based on owner survey data 
that may be influenced by breed stereotypes 
and other factors, and differences are not nec- 
essarily genetic in origin. For example, for 
human sociability (factor 1), an individual 


29 April 2022 


Labrador retriever (1.4-fold), golden retriever 
(1.6-fold), American pit bull terrier (1.4-fold), 
or Siberian husky (1.7-fold) was more likely to 
score in the highest quartile than a randomly 
selected dog, whereas a German shepherd dog 
(0.78-fold), Chihuahua (0.72-fold), or dachshund 
(0.56-fold) was less likely. Even so, in every 
breed represented by 25 or more dogs, the 
majority scored within one SD of the Darwin’s 
Ark cohort mean (67.2 + 7.5% within one SD 
and 95.4 + 3.0% within two SD for confirmed 
purebred dogs). Behavioral factors show high 
variability within breeds, suggesting that al- 
though breed may affect the likelihood of a 
particular behavior to occur, breed alone is not, 
contrary to popular belief, informative enough 
to predict an individual’s disposition. 

We developed an interactive dashboard 
(https://darwinsark.org/muttomics) to illus- 
trate the value offered by breed for predicting 
behavior in any individual dog. For example, 
the chance that an owner scores an individual 
dog in the highest quartile for human socia- 
bility increases from 22% for a mutt to 40% if 
that dog is a golden retriever (fig. S11). Users 
can select one or a combination of character- 
istics, and the site dynamically updates to show 
the frequency in 23 breeds and in mutts. 


Measuring breed peculiarity 


We developed a permutation-based approach 
to measure when dogs of a particular breed 
are described by owners as having behavioral 
characteristics that significantly differentiate 
them from other dogs. For each phenotype, 
we compared dogs within each breed to dogs 
sampled randomly from the full cohort, pro- 
ducing a “population peculiarity score” (PPS) 
(22). We tested both “confirmed” breeds (sam- 
ple 7 = 50; up to six breeds) and “candidate” 
breeds (sample n = 25; up to 62 breeds) (Fig. 4, 
D and E; figs. S12 and S13A; and data S11). 

Overall, breeds were only subtly differ- 
entiated on behavioral phenotypes. In the 
confirmed purebreds, only 5.1% (30/583) of 
breed-phenotype pairs were significantly dif- 
ferentiated for behavioral questions, compared 
with 41.5% (17/41 pairs) for physical traits. 
Scores for behavioral questions were not more 
correlated with each other than were scores for 
physical questions (table $7). Intrinsic motor 
patterns and physical traits tend to be slightly 
more breed differentiated (fig. S13B). 

No behaviors are exclusive to any breed (fig. 
S14). Even in the breed with the lowest howling 
propensity, confirmed Labrador retrievers (Q17; 
N = 241; 78.4% never howl), 8% of owners 
report that their Labrador howls sometimes, 
often, or always. Although 90% (53/57) of con- 
firmed greyhounds are reported to never bury 
their toys (Q29), owners described three dogs 
as frequent buriers. 

We used the same permutation approach 
to measure how behavior changes as dogs age 
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Fig. 4. Breed and age 
(and not size) have 
subtle effects on the 
behavior traits surveyed. 
(A) LD-corrected SNP- 
based heritability is much 
higher for physical traits, 
and somewhat higher 

for motor patterns, 
compared with other 
behavioral traits 
(significance measured 
using Student's t test; 
Benjamini-Hochberg 
(BH)-adjusted p value is 
shown). Shaded regions 
indicate the probability 
density. (B and ©) ANOVA 
in confirmed purebred 
dogs shows that (B) 

the effect size of breed 
exceeds 15% for 6/7 
physical traits and some 
behavioral questions 
(labeled bars) and that 
(C) for factors, breed 
explains more variation in 
scores than age and sex, 
and size has no significant 
effects. In (C), asterisks 
indicate statistical signifi- 
cance (BH-adjusted p < 
0.05). (D and E) Permu- 
tations comparing breeds 
to randomly sampled dogs 
show that (D) many 
breeds are significantly 
differentiated on physical 
traits but that differentia- 
tion is much rarer for 
behavioral traits, including 
intrinsic motor patterns 
and (E) the eight behav- 
ioral factors. The selection 
of the most popular 
and/or most differentiated 
breeds is shown, with 

full results in fig. S12. 
Breeds in italics are 
epresented by fewer 
than 50 individuals and 
skew toward more 
extreme Z-scores (gray 
background; difference in 
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mean score = 0.29: pi-test = 2 x 10°”). (F) The effect of age, compared with the effect of breed, in the ANOVA shows that age explains nearly as 
much variance as breed for factor 2. (G) The PPS for arousal level and toy-directed motor patterns are significantly correlated with age, whereas biddability is more 
breed driven. Asterisks indicate significant results. 


(fig. S13C). Most questions (72%) and fac- 
tors (63%) are correlated with age [false dis- 


most composed breed (Great Pyrenees), and 
puppies are far more toy-directed in their 


Testing breed stereotypes 
PPSs aligned, to a limited degree, with be- 


covery rate (FDR) p value (pgpr) < 0.05] (figs. 
S15 and S16). Older dogs score nearly as com- 
posed in their arousal level (factor 2) as the 
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motor patterns (factor 3) than one of the 
most toy-directed breeds (German shepherd 
dog) (Fig. 4, F and G). 
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havioral stereotypes described in the breed 
standards (data S12). The American Kennel Club 
(AKC) describes each breed with a three-word 
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phrase (e.g., border collies are “affectionate, 
smart, energetic” and beagles are “friendly, 
curious, merry”) (52) (table S2). Breeds de- 
scribed with particular words are not behav- 
iorally distinct from other breeds; however, 
“charming” tends to describe breeds that are 
less toy directed (factor 3; DrpR = 0.039) (fig. 
$17). Grouping breeds by their proposed his- 
toric working role, as captured by AKC show 
groups (53), finds that four out of six show 
groups are peculiar on at least one factor 
(Fig. 5A). Herding breeds are more toy directed, 
more biddable, more engaged, and more aloof, 
whereas toy breeds are more independent and 
less dog social. Sporting breeds are more toy 
directed, and working breeds are more dog 
social (Fig. 5B and fig. SI8A). 

We found more support for breed behav- 
ioral stereotypes when comparing the PPS 
results to quantitative rankings from the 
Encyclopedia of Dog Breeds for each breed 
on 10 behavioral characteristics (19) (fig. 
S17B). Nine of the 10 correlated significantly 
with PPS for at least one factor. Breeds ranked 
high on “ease of training” tended to be more 
biddable (factor 4) and more toy directed 
(factor 3). Breeds ranked as low on “energy 
level” scored as more composed, more dog 
social, and less environmentally engaged 
(factors 2, 6, and 7). “Watchdog ability” and 
“friendliness towards strangers” both corre- 
lated with human sociability (factor 1), but in 
opposite directions. 

Overall, when comparing breeds to all pet 
dogs, breed differences based on owner re- 
ports align with some breed behavioral ste- 
reotypes, with one major caveat. Using survey 
data alone, we cannot distinguish environ- 
mental effects, including the effects of the 
stereotypes themselves (e.g., by influencing 
owner’s perception of their dog’s behavior), 
from genetic effects. 


Human perception of breed in mutts 


Half of the Darwin’s Ark cohort were mutts, 
offering an opportunity to test whether breed 
stereotypes have a genetic etiology. In pure- 
bred dogs, cultural breed stereotypes affect the 
perception of a dog’s behavior and thus may 
alter a dog’s environment (54, 55) or introduce 
rater bias into owner survey responses. If breed 
ancestry is not readily discernible in mutts, 
these nongenetic factors would be mitigated, 
allowing us to discern the genetic effects of 
breed from human perception and other envi- 
ronmental factors. 

To measure how accurately breed can be 
discerned from physical characteristics in 
mutts, we ran a 2-month community science 
project, MuttMix (muttmix.org) that recruited 
26,639 participants (Fig. 5C). For 30 mutts 
with complex genetic ancestry (i.e., no first- 
generation crosses) (22), we asked participants 
to guess their top three breeds. Between 
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13,662 and 14,160 participants submitted re- 
sponses for each dog. They accurately iden- 
tified, on average, 20.9 + 20.4% (+SD) of each 
dog’s breed ancestry (fig. SI9A). Breeds com- 
prising a smaller proportion of a dog’s ances- 
try were especially challenging to identify 
(Rpearson = 0.61; p = 3.3 x 107°) (fig. S19B). 
Thus, any effect of perceived ancestry on 
survey responses is likely to be substantially 
mitigated in mutts. 

The physical characteristics associated 
with a breed, like short fur (American pit bull 
terrier), short legs (dachshund), or pricked 
ears (Chihuahua), influenced how participants 
guessed, but this is an error-prone approach 
(Fig. 5, D to F, and fig. S19C). Dogs with an- 
cestry from more popular breeds had more 
correct guesses because participants tended 
to guess popular breeds more frequently (with 
the exception of the underguessed American 
pit bull terrier) (Fig. 1G and fig. S20). To control 
for this, we calculated how often we would 
expect to see each possible combination of 
breed guesses if breed guess rates matched 
the population frequencies in Darwin’s Ark 
(table $2) and compared the observed rate of 
correct guesses to the expected rate. Partici- 
pants guessed correctly more often than ex- 
pected for 73% of dogs (Fig. 5C and fig. S21A). 
The number of correctly guessed breeds by 
each participant for each dog was slightly 
higher for self-described dog professionals 
LN = 84,918; mean = 0.93 (SD 0.74)] than 
nonprofessionals [N = 333,614; mean = 0.81 
(SD 0.72)] (fig. S21B). 


Effect of breed ancestry in mutts 


We measured whether breed influences be- 
havior through genetics by examining only 
mutts with <45% ancestry from any single 
breed. This analysis illustrates the power of in- 
cluding complex mixes in behavioral studies, 
because any influence of breed stereotypes is 
mitigated when true breed ancestry is not read- 
ily discernible from appearance. We built linear 
mixed-effects regression (LMER) models for 
all factors and questions (breed ancestries as 
fixed effects; age and pairwise genetic related- 
ness as random effects) (22) (Fig. 6, A to C; fig. 
$22; and data S13). The proportion of variance 
in factor scores captured by genetic breed 
ancestry (marginal R”) averaged 9 + 3% (+SD), 
suggesting a weak but discernible genetic 
effect of breed on disposition (fig. S23B and 
data S14). Breed ancestry explained, on aver- 
age, 20 + 12% (+SD) of variance for physical 
traits in mutts. 

We validated the LMER method by confirm- 
ing that the ancestry effects for physical traits 
matched the breed standards for physical 
appearance (38). For example, ancestry from 
nine breeds (six with long fur and three with 
short fur) had a significant effect on fur length 
in mutts, and for each breed, the direction of 
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effect matches the breed standard. In total, 
we assessed 51 breed-trait pairs for four traits 
(size, white coat color, ear shape, and fur length). 
The direction of effect matched in 50 out of 
51 (table S8). For the size question (Q121), the 
LMER score is strongly correlated with the 
breed average height (Rpearson = 0.86; p = 2.3 x 
10°°; N = 19). 

Correlation between the LMER results and 
the PPSs confirm that some behavioral dif- 
ferences in mutts derive from differences in 
breed ancestry (NV = 6333; Rpearson = 0-28; p = 
18 x 10°”) (fig. S24 and data S15). For ex- 
ample, mutts with more border collie ancestry 
tend to be more biddable (factor 4; LMER t¢ = 
-4.6; DepR = 0.0002), consistent with survey 
data for confirmed border collies [PPS z = 
-4.6; corrected p value (Deorr) = 2 x 107°). 
Similarly, mutts with more Labrador retriev- 
er ancestry tend not to avoid getting wet 
(Q60; LMER ¢ = 3.8; Depr = 0.003), like many 
confirmed Labrador retrievers (PPS z = 4.3; 
Pcorr = 0.003). 

Discordance between the LMER results and 
breed differentiation measured by PPS may 
capture nongenetic influences on survey re- 
sponses such as breed stereotypes. Owners 
of confirmed golden retrievers, for example, 
tend to disagree that their dog is fearful of 
unfamiliar people (Q46; PPS z = 4.6; Deorr = 
0.002), which fits the breed stereotype that 
golden retrievers are friendly to strangers (19). 
In mutts, however, golden retriever ancestry 
had no effect on this question (LMER ¢ = 0.3; 
Prpr = 0.88), suggesting that the reported 
propensity may not be driven by genetics (fig. 
$23A). Similarly, whereas owners of confirmed 
Labrador retrievers tend to describe their dogs 
as more human social (factor 1; PPS zg = 3.3; 
Deorr = 0.006), in line with the breed stereotype 
(“friendly, active, and outgoing”), in mutts, 
Labrador retriever ancestry has little effect 
(LMER t = 0.45 Depr = 0.90). Owners of con- 
firmed border collies tend to score their dogs 
higher on “wants to play” (Q2; PPS zg = -3.6; 
Deorr = 0.04); this is consistent with the 
stereotype that border collies are “affection- 
ate, smart, and energetic” but discordant 
with the LMER results, which find no effect 
of border collie ancestry (LMER ¢ = 0.089; 
Prpr = 0.97) 

By analyzing the effect of ancestry on be- 
havior in mutts, we can anticipate the likely 
behavioral propensities of breeds that are not 
well represented in our survey data (table S9). 
For example, Saint Bernard ancestry corre- 
lates with being more affectionate (factor 8, 
LMER ¢ = -4.1; Depr = 0.002) and Shar-Pei 
ancestry with being less toy directed (factor 3; 
LMER ¢ = 4.6; Prpr = 0.0002) (Fig. 6B). An- 
cestry from Chesapeake Bay retrievers increases 
with propensity to damage doors (Q40; LMER 
t = 4.2; p = 0.001) and escape from enclosures 
(Q35; LMER t = 3.5; Depr = 0.02). 
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Fig. 5. Breed stereotypes can be assessed in mutts, where environmental 
effects (e.g., owner perception) are mitigated by the difficulty of accu- 
rately discerning the breed. (A) Breeds grouped by purported historic working 
roles are more differentiated (measured by mean PPS) on some factors than 
other breeds (58 breeds total), measured as a t statistic using Student's t test. 
Error bars represent 95% confidence intervals. (B) Toy and herding breeds 
illustrate groups with shifts in opposite directions (full results in fig. S18). 
Points are PPS for breed, vertical lines are mean PPS for group, boxes enclose 
the 25 to 75% quartiles, and horizontal lines extend from 1.5 times the 
interquartile range (IQR) below the 25% quartile to 1.5 times the IQR above 
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entropy explained by phenotypes in mutts 


the 75% quartile. Arrows indicate the direction of change in means, and words 
show the favored behavioral propensity. Herding breeds are Australian cattle 
dog, Australian shepherd, Belgian malinois, border collie, Catahoula leopard dog, 
collie, German shepherd dog, Pembroke Welsh corgi, and Shetland sheepdog. 
Toy breeds are bichon frise, Cavalier King Charles spaniel, Chihuahua, Havanese, 
Maltese, miniature pinscher, papillon, Pomeranian, pug, shih tzu, toy poodle, 
and Yorkshire terrier. (C) For each dog in the MuttMix survey, the ratio of 
observed:expected correct guesses for (i) one or more or (ii) two or more of the 
highest-content breed ancestries (blue indicates a ratio >1; open circles are 
not significant). (D) For six dogs that have a similar amount of genetic ancestry 
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detected from American pit bull terrier (~25 to 30%), participants guessed this 
breed at rates ranging from 1 to 60%. [Photo credits: J. O'Donnell (Jack); T. Fortier 
(Rosie); A. Phelps (Reilly); L. Moses (Rudy); R. Skloot (Clarence); M. Bishop (Esme)] 
(E) For three individual mutts, the most guessed breeds (top) differ from the 

its: E. Winchester (Maxine); 


genetically inferred breed ancestry (bottom). [Photo cred 


Genome-wide association studies in all dogs 

We first investigated breed-defining physical 
traits with known large-effect loci using a mixed 
linear model-based approach for genome- 
wide association (56) across 8,518,951 SNPs 
of >2% MAF. We controlled for population 
and family structure and cryptic relatedness 
in our complex cohort (600 purebred dogs, 
representing 88 breeds, and 1496 mutts) using 
a genetic relationship matrix in a mixed- 
model framework. None of the genome-wide 
association studies (GWASs) had unusual ge- 
nomic inflation [mean inflation factor (Agco.5) = 
0.985 + 0.016 (+SD); range 0.960 to 1.03; N = 
14], suggesting that the mixed-model frame- 
work controls for confounding due to popula- 
tion structure and other factors (57). 

We successfully replicated 17 published as- 
sociations for physical traits other than size 
(table S10), including for genes MITF (14) 
with white spotting [p = 2.89 x 107°”; SNP ef- 
fect size (b) = -0.78], FGF5 and RSPO2 (58) 
with coat length (p = 5.46 x 10°"; b = +0.37) 
and texture (p = 6.35 x 107°; b = +0.11), USH2A 
(59) with roan and/or ticking (p = 5.31 x 10°"; 
b = +0.20), RUNX3 (60) with pheomelanin in- 
tensity (p = 4.11 x 1078; b = -0.20), and the 
6-defensin region (67-63) with brindle coat 
patterns (p = 2.50 x 1071; b = +0.35) (fig. S25, 
D to I, and data S16). 

For size, a quantitative trait, we replicated 
10 previously published associations (40, 64-71) 
(Fig. 6D, fig. S25A, and table S10) and found 
new associations to SARIB [p = 2.01 x 107°; b = 
+0.12; metabolic disorders (72, 73)] and ANAPCI 
[p = 4.11 x 107°; b = +0.15; short stature in 
Rothmund-Thomson syndrome (74)]. By com- 
paring giant dogs (NV = 55) and then tiny dogs 
(N = 55) to average dogs (N = 1841), we dis- 
tinguished variants associated with gigan- 
tism (fig. S25B) and dwarfism (fig. S25C) 
specifically. The FGF4 retrogene locus, pre- 
viously associated with chondrodysplasia (67), 
is more strongly associated in the tiny GWAS 
(Dan = 1.16 x 107°; Diiny = 1.15 x 10°°°), dwarf- 
ing all other loci. 

The height associations were robust even 
in the absence of purebred dogs, suggesting 
that the all-mutt GWAS might offer equivalent 
power to one that includes purebred dogs. In 
dogs carrying less than 45% ancestry from any 
breed, a cohort with about half as many dogs 
(970 versus 1951), we identified all the major 
stature-associated loci (Rpearson = 0-91; p < 
1x 10-8) as well as a new association in LRIG3 
(p = 7.29 x 10 "°; b = -0.31), a gene involved in 
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physical characteristics inf 


leave-one-out analysis (full 
likely guessed Irish wolfhou 


bone morphogenetic protein-mediated body- 
size regulation (75) (fig. S25J). 

Genomic predictions for height based on 
the GWAS-identified variants perform well 
in both purebred dogs and mutts, reflecting 
the strong selection on size among dog breeds. 
For a random forest regression model built 
using 1730 dogs and 2733 size-associated SNPs 
(p < 1x 10°°) (22), predictions carried a mean 
squared error of 0.3 (fig. S26) and 66% of pre- 
dictions fell within +0.5 units of the relative 
size score (fig. S2) (Rpearson = 0.77, D = 3.90 x 
10-°°°), with no drop in accuracy for predic- 
tions made on mutts [predicted and true values 
differed by 0.46 + 0.35 (+SD) in purebreds 
versus 0.43 + 0.36 (+SD) in mutts; Di-test = 
0.08, ¢ = 1.75, df = 832]. Randomly selected 
SNPs, by comparison, performed poorly, with 
a mean squared error of 0.5 (45% of predic- 
tions within +0.5 units). Predictions for relative 
stature validated well against more precise 
measurements taken in person (N = 310 dogs; 
Rpearson = 0.91, p = 8.8 x 10°") (Fig. 6E and 
fig. S27). 


Behavioral GWASs 


Applying the same GWAS approach to the 
behavioral phenotypes identified 11 genome- 
wide significant (p < 5 x 107*) (76) and 136 sug- 
gestive (p < 1 x 10~®) associations (data S16). 
As with physical traits, the behavioral GWAS 
had minimal genomic inflation [mean Agco,5 = 
0.995 + 0.0087 (+SD); range 0.976 to 1.05; N = 
118]. The associations for behavioral traits were 
weaker, consistent with a more complex ge- 
netic architecture. They have not yet been in- 
dependently replicated. The most significant 
association, to “gets stuck behind objects” 
(Q36), mapped to a 380-kb region (p = 8.36 x 
10"; b = +0.54) (Fig. 6F and fig. $25K) con- 
taining SNX29, a gene associated with cogni- 
tive performance in human GWASs (77-79). 
“Dog howls” (Q17) mapped to an intergenic 
region (p = 9.63 x 10°"; b = +0.54) (Fig. 6G and 
fig. S25L) between SLC38A1I and SCN3A, a 
voltage-gated sodium channel involved in the 
development of speech and language (80). The 
top association to a behavioral factor was for 
human sociability (factor 1), downstream of 
the gene HACDI (p = 2.41 x 10°°; b = -0.36) 
(Fig. 6H and fig. S25M), a regulator of long- 
term memory (87) that is also associated with 
centronuclear myopathies (82). 

In our diverse cohort with dense genotyp- 
ing data, associated regions are smaller than 


R. Bacon (Jack); E. Stackpole (Bella)] (F) The dogs in (E) illustrate how a mutt’s 


uence participant breed guesses. Points show entropy 


explained by traits using guesses for all mutts (22), and bars span values from a 


results in fig. S18). For example, 67% of participants 
nd for Maxine because of her coat furnishings. 


with sparser marker sets. We compared our 
behavior-associated regions to those found in 
an earlier study of a different complex trait 
(osteosarcoma) at the same linkage threshold 
(r? > 0.8). In the Darwin’s Ark GWAS, asso- 
ciated regions extend to a median 5.6 kb (25 to 
75% quartile = 2.0 to 14 kb, mean 16.8 kb) 
around suggestive (p < 1 x 10°°) behavioral 
loci and 5.7 kb at physical trait loci (1.4 to 22 kb, 
mean 26.2 kb). By contrast, intrabreed GWASs 
of osteosarcoma in three breeds with diverse 
population structures mapped at median ranges 
of 86 kb (25 to 75% quartile = 57 to 162 kb) in 
racing greyhounds, 54 kb (21 to 409 kb) in 
rottweilers, and 1 Mb (743 kb to 1.4 Mb) in Irish 
wolfhounds (83). This increased resolution may 
facilitate the search for causal variants. In the 
Darwin’s Ark GWAS, we can distinguish a 
region on chromosome 10 that is associated 
with stature (76.2 kb at r? > 0.8; HMGAQ; p = 
1.84 x 10°“; b = -0.31) from one associated 
with ear shape (118.7 kb at r? > 0.8; MSRB2; 
p = 6.02 x 10°”; b = -0.33) that were pre- 
viously linked in interbreed GWASs (71, 84) 
(fig. S28). 

The mixed-model association approach may 
not fully control for spurious association that 
arises when a trait differs between breeds. An 
association for “focused in distracting situa- 
tions” (Q21) (chr32:4,512,005; p = 1.0 x 10-8; 
b = -0.22) (fig. S25N) mapped to a locus con- 
taining FGF5, a gene associated with long- 
coated breeds (58). This association was lost 
when we conditioned on the top coat length- 
associated SNP (chr32:4,509,367; p = 0.0001; 
b = -0.15) (fig. S29), which is linked to the top 
focus-associated SNP (r? = 0.33). The original 
association likely reflected the spurious differ- 
ence in focus scores between dogs with shorter 
and longer coats (Pi-test = 0.00023; 2012 dogs). 
Pleiotropy is unlikely because fur length ex- 
plains almost no variation in focus scores 
(ANOVA ges = 0.0004; p = 0.35; N = 2456). 
Consistent with this, the focus association 
on chromosome 32 weakens (chr32:4,512,005; 
p =12 10°; b = -0.18) when we include the 
top 10 SNP-based principal components in the 
mixed model (fig. S30). 

To assess whether spurious breed-trait cor- 
relations are a major confounder in our analy- 
ses, we reran all GWASs and included the top 
10 principal components in the mixed model. 
Only 6% (3/48) of our genome-wide significant 
associations were lost (p > 1 x 10~°) (data S16). 
We also tested whether the top 75 regions asso- 
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Fig. 6. Genetics of aesthetic and 
behavioral traits in dogs and 
influence of breed ancestry in 
mutts. (A to C) In highly admixed 
dogs with no breed ancestry over 
45%, the fixed effects of breed ancestry 
on (A) physical traits, (B) behavioral 
factor scores, and (C) individual 
behavioral question scores are shown. 
(D) Manhattan plot for the GWAS of 
surveyed height (Q121) from 1951 dogs, 
including covariates for age and sex. 
Linkage blocks (r° > 0.2) associated 

(p <5 10°) with stature align 

with previous associations for body 
size in (a) IGFIR (68), (b) LCORL (40), 
(c) GHR (69), (d) SMAD2 (84), 

(e) HMGA2 (69) and the nearby 

(f) MSRB3, (g) a chromosome 12 
retrogene insertion of FGF4 (70), 

(h) IGF1 (64-66), (i) another FGF4 
retrogene on chromosome 18 (67), 

(j) MEDIS3L (40), and (k) IGF2BP2 (40). 
Two previously unknown associations 
were found spanning JADE2 and SARIB, 
and in ANAPCI. (E) Random forest 
models based on size-associated 

SNPs (p < 1 10°) accurately predict 
body size and correlate strongly 

(N = 310 dogs: Ryearson = 0.91, 
p=88x 10" t= 37.451, df = 308) 
with real measurements in those 
dogs. (F to H) Regional association 
plots for (F) scores on Q36 “gets stuck 
behind objects,” (G) human sociability, 
and (H) frequency of howling from 
Q17. In addition to protein coding 
genes (black boxes), we also show 
representative open chromatin regions 
(rOCRs; narrow vertical lines). We 
annotated rOCRs genome-wide using 
ENCODE methods (101) applied to 
canine ATAC-seq (assay for transposase- 
accessible chromatin using sequencing) 
data from 14 tissues (102) and mam- 
malian sequence constraint (103). 

(I) Breeds show high genetic differenti- 
ation, measured as the population 
branch statistic, overlapping physical 
trait associated loci compared with 
00,000 randomly permuted regions 
(N = 1232, mean z = 0.49, p = 7.3 x 
10°). Regions associated with behav- 
oral factors (N = 512, mean z = 0.03, 
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p = 0.224) and question scores (N = 9317, mean z = 0.00, p = 0.603) do not show such differentiation in breeds. Red circles indicate mean, with horizontal lines at the 25% 


quartile, median, and 75% quartile. The shad 


trait) were enriched for SNPs associated in 
any of the 119 behavioral GWASs (fig. S31). 


Only one GWAS (Q66, “lifts leg to urinate”) 


was significant [adjusted p value (p,q) = 0.013]. 
Thus, although spurious associations due to 
aesthetic traits are a concern in multibreed 
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ed area is the probability density, with significant different 


Unattributed heritability 


A large proportion of the genetic and envi- 
ronmental contributions to behavior remains 


GWASs, they are likely rare in the GWAS run 
on our diverse cohort. 


iation in red. ns, not significant; ****p < 0.0001 (Student's t test). 


undiscovered. The SNPs that we found to be 


associated with heritable (R’ snp > 0.1) behav- 
ioral traits account for a smaller proportion 
of overall heritability than do aesthetic trait 


associations (22), consistent with a complex 
genetic architecture. For the 14 physical traits, 
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53.0 + 30.2% (+SD) of heritability is attribut- 
able to associated SNPs (p < 1 x 10°), but for 
the eight behavioral factors and 73 questions, 
this drops to 21.0 + 12.8% and 27.9 + 20%, 
respectively. The six associated loci accounted 
for 42.7% of the genetic component of dog 
sociability (h?syp = 14.8 + 6.1%), whereas just 
4.3% of highly heritable human sociability 
(h> snp = 41.5 + 9.1%) could be explained by 
its single associated region. 


Brain-expressed genes enriched in 
behavior GWASs 


Regions associated with dog behavioral phe- 
notypes are enriched in brain-expressed genes. 
We cataloged the genes expressed in 38 tissue 
types, including 13 brain regions, using human 
GTEx data (85), an approach used previously 
in dogs (86). We also collated genes from 
curated lists for obsessive-compulsive disorder 
(OCD) (87), autism-spectrum disorders (88), 
and schizophrenia (89, 90). Using MAGMA 
(22, 91), we tested all GWASs for enrichment 
(data S17). Regions associated with toy-directed 
motor patterns (factor 3) had the strongest 
enrichments, which were for genes expressed 
in the hippocampus and in the basal ganglia of 
the nucleus accumbens, caudate, and putamen. 
Associations for “not keen on new situations” 
(Q84) were enriched for hypothalamus- 
expressed genes (fig. S30). Overall, enrichments 
in genes associated with neuropsychiatric con- 
ditions were weak, peaking for the enrichment 
of human OCD genes in Q84-associated regions 
(p = 0.0012; paaj = 0.24). 


Aesthetic selection predominates in breeds 


Associations to physical traits, but not be- 
haviors, tend to overlap signals of genetic dif- 
ferentiation in modern breeds, suggesting 
that aesthetics, and not behavior, has been the 
focus of selection. We tested for sites with ex- 
cess differentiation in each breed with >12 dogs 
using the population branch statistic (PBS) 
test (92), using all dogs (NV = 3802 to 3878) and 
wolves (N = 48) as the two outgroups across 
~27.6 million SNPs from publicly available and 
our genetic data (data S4 and S6). Among 
the top 0.1% of breed-differentiated regions 
(26 + 6 regions per breed), we validated ge- 
netic signals of selection reported at EPASI, 
for hypoxia tolerance, in Tibetan mastiffs (93); 
at CACNAIA, unknown phenotype, in two sled 
dog breeds (94); at ESRI, unknown pheno- 
type, in long-legged sighthounds (40); and at 
ALX4, a blue eye color gene, in Siberian huskies 
(95) (data S18). 

We used permutation (22) to test whether 
PBS scores are unexpectedly high in regions 
associated with traits (data S19) and found 
that, whereas physical trait-associated regions 
are more differentiated, those associated with 
behavioral traits are not (mean 2 = 0.491 versus 
-0.001; Di-test = 4 x 10) (Fig. 61). Considering 
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all moderately associated GWAS regions (p < 
1x 10°), 25 of 65 (39%) physical trait loci are 
unusually differentiated, whereas only 38 of 
515 (7%) behavioral trait loci are, and a sub- 
set of those are also connected to physical 
traits (data S20). Differentiation at physical 
trait loci is consistent with ongoing selection 
to meet strict morphometric standards in 
breeds (38), and the lack of overlap for most 
behavioral traits suggests weaker or absent 
selection. 

The lack of differentiation at behavioral loci 
is not inconsistent with heritable behavioral 
differences in breeds, which may reflect ge- 
netic drift or selection that predates breed 
creation, neither of which the PBS test is 
designed to detect. To this point, neither of 
the two loci associated with howling are dif- 
ferentiated in either the Siberian huskies or 
beagles, even though ancestry from these 
breeds influences howling propensity. 


Discussion 


Behavioral traits are subtly differentiated in 
modern breeds (Fig. 2B). Furthermore, breed 
offers only modest value for predicting the 
behavior of individual dogs. For more herita- 
ble and more breed-differentiated traits, like 
biddability (factor 4), knowing breed ancestry 
can make behavioral predictions somewhat 
more accurate in purebred dogs. For less her- 
itable, less breed-differentiated traits, like 
agonistic threshold (factor 5), which measures 
how easily a dog is provoked by frightening, 
uncomfortable, or annoying stimuli, breed is 
almost uninformative. 

In our ancestrally diverse cohort, we show 
that behavioral characteristics ascribed to 
modern breeds are polygenic, environmen- 
tally influenced, and found, at varying preva- 
lence, in all breeds. They likely naturally arose 
over millennia as dogs followed human mi- 
grations and adapted to new human technol- 
ogies (2). The tight bottlenecks that established 
modern breeds captured ancient variation, at 
varying frequencies, with subsequent genetic 
drift or selection further shaping modern breeds 
(Fig. 3, A and B). 

We found no evidence that the behavioral 
tendencies in breeds reflect intentional selec- 
tion by breeders (Fig. 61) but cannot exclude 
the possibility. Current datasets are too small 
to detect more subtle, recent directional selec- 
tion, which requires hundreds of thousands 
of samples (96). In dogs, breed demographic 
history makes detecting selection particularly 
challenging (J, 97). 

Canine behavioral disorders are a proposed 
natural model for human neuropsychiatric dis- 
eases (25, 27). Here, we show that large-scale 
behavior GWASs in dogs are tractable, identi- 
fying dozens of loci associated with behavioral 
traits in dogs. These associations explain a 
fraction of overall heritability, suggesting that 
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still-larger sample sizes are needed. Our study 
design, combining owner-engagement with 
low-pass sequencing (45), makes this eminently 
achievable. We anticipate that this approach 
will be even more powerful once methods for 
accurately assigning local ancestry in indi- 
viduals with >100 potential source popula- 
tions (compared with two or three in human 
studies) are validated and incorporated into 
dog GWASs (98). 

As dog studies grow in scale and complexity, 
it is crucial that we meet the standards of sta- 
tistical rigor developed by the human genetics 
community and carefully account for con- 
founding by artificial selection for aesthetic 
extremes in modern breeds (99), which can 
create misleading signals of association. One 
approach for studying behavior in dogs has 
been to compare breeds, rather than indi- 
viduals, using breed-level behavioral pheno- 
types. The wide variability in behavior within 
breeds, and the potential for spurious correla- 
tions with breed-defining aesthetic traits, sug- 
gests that any discoveries made using this 
approach should be carefully validated using 
other methods. 

To date, dog genetics has focused on mod- 
ern breeds, which capture just a tiny fraction 
of global canine diversity. Although this made 
early genomic studies feasible (14), it limits 
discovery today (J00). By embracing the full 
diversity of dogs, including purebred dogs, 
mutts, purpose-bred working dogs, and vil- 
lage dogs, we can fully realize dogs’ long- 
recognized potential as a natural model for 
genetic discovery. 


Materials and methods summary 


Materials and methods described in full de- 
tail can be found in the supplementary mate- 
rials (22). 


Survey data collection 


We collected consent, profile information, and 
surveys for 18,385 dogs enrolled by their own- 
ers via the Darwin’s Ark platform (https:// 
darwinsark.org) on or before 15 November 2019. 
Profile information included the dog’s approx- 
imate birth date, sex and sterilization status, 
suspected or known breed(s), purebred registra- 
tion, and/or photograph. We collected 12 sur- 
veys, including 11 about behavior (10 questions 
each) and one about physical characteristics 
(eight questions), for a total of 118 survey items 
(table S1). All responses to survey questions 
were time stamped, and ages at the time of 
survey were calculated relative to reported 
birth date (22). 

The 110 behavioral questions all used a five- 
point Likert scale: (i) 81 questions had options 
of strongly agree, agree, neither agree nor 
disagree, disagree, or strongly disagree; and 
Gi) 29 had options of never, rarely, sometimes, 
often, or always. We sourced 79 behavioral 
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questions from published and validated sur- 
veys: (i) Dog Personality Questionnaire (DPQ/ 
DPQL; 45 questions) (37); (ii) Canine Health- 
related Quality of Life Survey (CHQLS; 11 ques- 
tions) (36); ii) Dog Impulsivity Assessment 
Scale (DIAS; 18 questions, including one also 
in DPQ) (34); and (iv) Canine Cognitive Dys- 
function Rating scale (CCDR; six questions) 
(35). We validated the performance of be- 
havioral surveys using a Mantel’s test on the 
inter-item correlation distance (d = 1 - |r|) 
matrices between published data for 48 DPQ 
items (NV = 2556 dogs) and our data. We in- 
cluded 31 new behavior questions developed 
with input from canine behavior professionals 
in the International Association of Animal 
Behavior Consultants. The physical character- 
istics survey used a variety of response types 
(table S1). Answers of “I’m not sure,” “I don’t 
know,” “not sure,” and “surgically cropped ears” 
(Q125) were excluded. 

Dog size was measured through Q121: “When 
DOG is standing next to someone of average 
height, how high are HIS shoulders?” This 
question was validated in three ways (fig. S2 
and data S1): (i) owners were provided with 
a measuring tape by mail and instructed to 
measure the height from their dog’s shoulder 
to the ground using the provided measuring 
tape (337 dogs); Gi) dogs were measured 
(height to withers) by professionals during the 
2017 Somerville Dog Festival in Somerville, 
MA (38 dogs); and (iii) owner-reported size 
was compared with average breed height 
(2025 purebred dogs). 

We performed exploratory factor analysis 
on the behavioral surveys (10,253 dogs with 
responses for all 110 questions) and extracted 
the optimal number of factors as estimated by 
the Horn’s parallel analysis and optimal coor- 
dinates heuristic methods (20 factors; table $3). 
A varimax orthogonal rotation was applied to 
generate a structure matrix with factor load- 
ings for each item, and items with low pat- 
tern or structure loadings (less than +0.3) 
were removed. We generated factor scores for 
6269 additional dogs with responses to >80% 
of questions by populating missing responses 
through random sampling. The dog’s age for 
each factor is the mean age for all responses to 
included questions. 


Sample collection 


Animal study protocols for saliva and blood 
collection from dogs were approved by the 
UMass Chan Medical School Institutional 
Animal Care and Use Committee (IACUC) 
(no. A-2520-18). We sent or gave owners saliva 
collection kits (DNA Genotek PG-100 saliva 
swabs) for sampling. For a subset of dogs, 
owners provided blood collected by their 
veterinarian. We selected dogs for sequencing 
primarily based on survey completeness and 
enrollment date. Of 1715 samples submitted 
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for low-coverage DNA sequencing, 159 samples 
(7.4% of 2155 dogs in the genetic dataset) were 
funded by owner donations to the Darwin’s 
Ark Foundation, a 501(c)(3) nonprofit orga- 
nization (82-3942341). 


High-coverage genome sequencing and analysis 


We performed high-coverage [45x + 10x 
(+SD)] WGS on samples from 27 putatively 
mixed-breed dogs (the Mendel’s Mutts cohort) 
(data $2). For the initial 22 mutts sequenced, 
we performed joint variant calling with pub- 
licly available data for 620 other dogs and 
34 canids (data S4) using the Genome Analysis 
Toolkit (GATK3) (22) on the CanFam3.1 refer- 
ence assembly. The final variant call file con- 
tained 34,191,821 SNPs and 11,943,064 indels. 
For the five mutts sequenced later, genotypes 
were called for the same set of variants using 
GATK3 HaplotypeCaller. 

We compared cumulative variant discovery 
using purebred versus mutt genomes using 
chromosome 13 as a random proxy for the 
whole genome. We tested six cohorts: one dog 
sampled at random per breed (N = 128 possible 
dogs), Mendel’s Mutts (NV = 27 dogs), and the 
four breeds with >27 individuals sequenced 
(22). We computed the cumulative distribu- 
tion of the fraction of 619,031 variants dis- 
covered using 557 purebred dogs versus using 
10 dogs randomly chosen and ordered with- 
in each cohort and computed the 95% confi- 
dence interval using random reordering within 
each cohort. 

We compared the lengths of detected runs 
of homozygosity (ROH) in mutts, dog breeds, 
and village dog genomes across biallelic SNPs 
using PLINK v1.90b6.21 with a minimum length 
of 100 kb and 100 SNPs, with at least 1 kb 
per SNP (22). We then randomly sampled 7 = 
464 runs (the mean number of ROH detected 
per mutt) from the pool of ROH detected in 
mutts, purebred dogs, and village dogs, re- 
sampling N = 100 times. 

We measured LD in mixed-breed dogs 
(Mendel’s Mutts), breeds (golden retriever, 
Labrador retriever, Leonberger, and Yorkshire 
terrier), village dogs, and wolves by randomly 
sampling 25 dogs from each cohort and, for 
20,000 randomly sampled biallelic SNPs, 
measuring 7” to all SNPs within 100 kb. We 
assessed tagging of genetic variation using 
genotyping arrays by measuring 7? between 
the same set of random SNPs and the sub- 
set of SNPs on the array (171,882 for the 
Illumina HD Canine Genotyping Array and 
1,011,992 for the Axiom Canine Genotyping 
Array Sets A and B). 


Low-coverage sequencing and imputation 


We piloted a low-pass sequencing and im- 
putation approach (42-46) using a panel of 
reference haplotypes from high-coverage 
whole-genome sequences. Autosomal variant 
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calls were inferred directly from sequencing 
reads through Gencove loimpute software 
(46) and a panel of reference haplotypes 
from publicly available WGS data [mean cov- 
erage 22.9x (SD 14.2x)] for 435 canids (data 
S4). The imputation process generated un- 
filtered genotypes for 32,438,672 SNPs and 
13,910,371 indels with imputation genotype 
probability (GP) scores per genotype per dog. 
We validated performance by comparing 
low-pass sequencing and imputation [1.0x + 
0.6x (+SD)] to array data (Axiom array) and 
high-coverage WGS data for 11 mutts with 
high-coverage WGS at low coverage. We also 
performed down-sampling of high-coverage 
WGS and subsequent imputation by the same 
method. 

We combined low-pass sequencing data for 
1715 dogs [0.6x + 0.3x (+SD)] with data for 
440 dogs genotyped on the Axiom array and 
imputed using the same haplotype reference 
panel (excluding genotypes of GP < 0.7). Af- 
ter merging, we performed additional quality 
control based on MAF, call rate, and Hardy- 
Weinberg equilibrium and validated owner- 
reported sex (22). The final dataset included 
8,518,951 biallelic, autosomal SNPs and 2155 
dogs at a genotyping rate of 97.5% (1084 males 
and 1071 females). 


Breed ancestry assignment 


We assembled a reference panel of 101 of the 
most common dog breeds in the United States 
(table S2) using high-coverage WGS for 380 dogs 
of 74 breeds (data S4), low-coverage WGS for 
115 dogs of 54 breeds, Axiom genotyping array 
data for 109 dogs of 43 breeds, and Illumina 
CanineHD arrays for 883 dogs of 90 breeds 
(22). For each breed, we selected 12 dogs for 
inclusion, prioritizing high-density raw data 
and genetic diversity within breeds. We im- 
puted genotypes for low-density data using 
the 435-canid panel of reference haplotypes. 
We retained SNPs genotyped in more than 
80% of dogs and at a MAF of at least 5%. 
Among ancestry-informative SNPs of Hudson’s 
estimator of fixation index (Fsr) > 0.15 be- 
tween breeds, we selected a dense set of 
2,468,442 markers (7? > 0.9 within 5 kb) for 
admixture simulations and a sparser set of 
688,060 markers (7? > 0.5 within 50 kb) for 
ancestry inference. 

We used a Monte Carlo approach to gen- 
erate simulated admixed genomes of known 
ancestral haplotypes and then compared the 
breed ancestry composition with ancestry in- 
ferred using ADMIXTURE (22). We simulated 
admixed individuals through N = 15 genera- 
tions of admixtures with the following proce- 
dure: NV + 1 random individuals from different 
breeds were selected to contribute to the ad- 
mixture. With each iteration, recombination 
was simulated to incorporate a new individ- 
ual. Recombination was treated as a Poisson 
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event that occurred, on average, once every 
Morgan. Simulations ran on 10 independently 
drawn datasets of six dogs per reference breed 
to create 1000 admixed individuals of known 
ancestry. We inferred global ancestry for sim- 
ulated individuals using the supervised mode 
of ADMIXTURE (random seed = 43) and the 
reference genotypes from six dogs reserved 
from each breed. 

We then performed supervised admixture 
analysis of the Darwin’s Ark genetic cohort. 
Genotype data from all query dogs was merged 
with all reference-breed data and filtered 
for SNPs in the global breed ancestry panel. 
Global ancestry from the 101 reference breeds 
were inferred using the supervised mode of 
ADMIXTURE (random seed = 43) that was 
supplied with reference population assign- 
ments. Population weights less than 1% were 
discarded from individual ancestry results. 

We combined breed ancestry assignments 
with survey data for dogs without genetic 
data to define three breed sets as decribed in 
the results: confirmed purebred dogs, candi- 
date purebred dogs, and mutts. 


Heritability analysis 

We estimated the SNP-based heritability 
(h’ snp) of surveyed traits using restricted 
maximum likelihood (REML) analysis imple- 
mented in the genome-wide complex trait 
analysis (GCTA, version 1.92.3 beta 3) software 
tool (56). We calculated LD scores in 250-kb 
regions using a block size of 10,000 kb with 
an overlap of 5000 kb between blocks. We 
generated a genetic relationship matrix (GRM) 
for the genetic cohort of 2155 dogs, as well as 
multiple GRMs calculated from SNPs strati- 
fied into LD score quartiles (22). The four LD- 
stratified GRMs were used to run REML 
analysis (GREML-LDMS) and estimate h7 snp 
with standard errors (data S8). 


Population peculiarity scoring 


We applied a custom permutation-based anal- 
ysis (22) to test whether groups of dogs de- 
fined by breed or age differed significantly in 
survey responses from randomly sampled 
groups on any survey item or factor. We in- 
cluded all dogs with any survey responses. 
For each permutation and a given sample 
size N (table S14), we calculated the mean 
(the observed test statistic) for each normal- 
ized survey response or factor score for N dogs 
sampled from among dogs of each grouping. 
For each permutation, we also calculated the 
mean for a random sampling of size N from 
the full dataset (the permuted test statis- 
tics). We counted how often the observed 
test statistics for each population were higher 
than the permuted test statistics. We ran a 
total of 500,000 permutations. To obtain the 
PPSs, we calculated the one-tailed empirical 
p values and generated zg-scores matching 
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the survey directionality. We also calculated 
the two-tailed p values corrected for multiple 
testing by a maxT procedure that preserves 
the correlational structure between survey 
items (22). 


Ancestry perception survey 


We designed the web-based MuttMix survey 
(muttmix.org) to assess perceptions of breed 
ancestry in mixed-breed dogs by nonowner 
observers. Participants self-identified as either 
general public or dog professional (yes or 
no to “Do you work with dogs professionally 
and/or are you a breeder?”). The survey con- 
sisted of 30 mixed-breed dogs with ancestry 
assignments and one purebred dog. Owners 
provided front and side photographs and a 
short video. Owners reported the dog’s rela- 
tive size (fig. S2F) and other physical descrip- 
tors. The images and information that were 
provided were shared with participants, who 
were asked to guess, for each dog, the three 
breeds detected in largest proportion (22). 
The survey launched on 16 April 2018 and 
closed on 16 June 2018, and responses were 
collected from 26,639 people over a 2-month 
period. 

We compared breed guesses to genetically 
inferred breed ancestry (22). Any breed call 
below 5% was removed and only breeds 
offered as survey options were examined. To 
calculate the average total percentage of an- 
cestry guessed correctly, we first calculated the 
percentage guessed correctly by each user for 
each dog by summing the percent genetic 
ancestry attributed to their top three breed 
guesses. To assess the accuracy of user guesses 
of breed ancestry, we first counted the number 
of breed guesses for a given dog that were 
among the top two or three breeds that were 
genetically detected. 

We measured how specific physical attrib- 
utes affected participants’ breed choices using 
entropy analysis (22). For each breed option, 
we calculated how well mutts’ phenotypes, 
defined binarily for each of eight different 
traits (height, leg length, ear type, coat type, 
coat length, coat furnishings, white spotting, 
and pigmentation), distinguished between 
participant guesses of presence versus absence 
of each ancestry. We applied a leave-one-out 
analysis, omitting guesses for each mutt in 
series, to assess the impact of guesses for each 
mutt on entropy reduction. To calculate sig- 
nificance, we randomized trait assignments 
across mutts and then asked whether entropy 
reductions from true traits were greater than 
those randomly assigned. 

We calculated how often we expected to see 
each possible combination of breed guesses by 
chance, assuming the guess rate for each breed 
to be the overall frequency of that breed (22) 
(table $2). We then calculated the observed 
rate of guesses with 1+, 2+, and 3 breeds cor- 
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rect for each dog and then calculated the ratio 
of the observed-to-expected rate. 


LMER models 


To measure the relationship of genetic breed 
ancestry to physical and behavioral pheno- 
types, we constructed LMER models using all 
dogs with <45% ancestry from any single breed 
(1002 dogs total). We treated normalized ques- 
tion and factor scores as independent varia- 
bles, breed ancestry as fixed effects, and age 
as random effect. For each survey item and 
factor, we built a model with REML to obtain 
unbiased estimates, standard deviations, and 
Wald statistics (t.val) for the fixed effects of 
breed on factor score and performed ANOVA 
to obtain the breed F statistics. To obtain the 
likelihood ratio for each breed, we constructed 
models using maximum likelihood with and 
without the breed and performed an ANOVA. 


GWASs using mixed linear models 


We performed genome-wide mixed linear 
model-based associations in the Darwin’s 
Ark genetic cohort using the “leave-one- 
chromosome-out” approach (MLMA-LOCO) 
implemented in GCTA (56) with categorical 
covariates for sex and data type (genotyping 
or low-pass sequencing) and quantitative co- 
variates for height and age for nonmorpho- 
logical traits. Because LD is nearly as short 
in diverse dogs as in humans, we used the 
thresholds for genome-wide significance (p = 
5 x 10°) and suggestive associations (p = 1 x 
10°) that are conventionally used in human 
GWASs (J, 76). 

We defined regions of association by clump- 
ing SNPs in LD (7° > 0.2 and 7” > 0.5) and near 
(<250 kb) associated index SNPs using PLINK 
(data S16). When comparing region sizes to the 
earlier osteosarcoma study (83), we used the 
same clumping thresholds. To assess how much 
phenotypic variance was explained by associ- 
ated regions, we derived genetic relationship 
matrices for regions of suggestive association 
(p = 1x 10°) with each trait and the set of all 
other SNPs and estimated the partitioned heri- 
tability as the proportion of total heritability 
unattributed by discovered associations. 

We built a predictive model for height as 
responses to survey Q121 for 1730 dogs older 
than 18 months and assessed its power through 
10-fold cross validation (9/10 training, 1/10 test- 
ing). At each round, we performed GWASs on 
the training set, selected SNPs for prediction 
at given p value cutoffs, built a random forest 
regression model, and assessed accuracy using 
the testing set. The reported accuracy and mean 
squared error are averaged across 10 rounds. 

We tested for enrichment of association 
summary statistics in three types of gene sets 
(22) by applying MAGMA (version 1.09) (91), a 
method that accounts for region size, variant 
count, and LD (data S17). 
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Genetic differentiation of breeds 


We calculated genome-wide normalized PBS 
scores using the Hudson estimator of fixation 
(Fsr) for each breed (Vp > 12 dogs, maximum 
88) relative to dogs overall (Vp = 3890 — Np) 
and wolves (Nw = 48) in sliding windows of 
100 kb by 10 kb (data S18) over ~27.6 million 
SNPs from publicly available and Darwin’s 
Ark genetic data (22). After dividing locus tests 
into physical trait, behavioral question, and 
behavioral factor associations, we performed 
a one-tailed Student’s ¢ test to test whether the 
observed maximum PBS within associated loci 
exceeded what we expect by random chance 
(data S20). To test whether allele frequencies 
at SNPs associated with behavioral or physi- 
cal traits tended to differ more in breeds, we 
calculated the max Fs observed between one 
of the top 10 breeds and all other dogs and 
compared this to 29,903 randomly sampled 
SNPs using a one-sided f test. 
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What is your dog like? 

Modern domestic dog breeds are only ~160 years old and are the result of selection for specific cosmetic traits. 

To investigate how genetics aligns with breed characteristics, Morrill et al. sequenced the DNA of more than 2000 
purebred and mixed-breed dogs. These data, coupled with owner surveys, were used to map genes associated with 
behavioral and physical traits. Although many physical traits were associated with breeds, behavior was much more 
variable among individual dogs. In general, physical trait heritability was a greater predictor of breed but was not 
necessarily a predictor of breed ancestry in mutts. Among behavioral traits, biddability—how well dogs respond to 
human direction—was the most heritable by breed but varied significantly among individual dogs. Thus, dog breed is 
generally a poor predictor of individual behavior and should not be used to inform decisions relating to selection of a 
pet dog. —LMZ 
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